Corpus: heb_newscrawl_2011_1M

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 91369 ו-
2 70308 ה-
3 54137 ש-
4 52100 מ-
5 49036 ב-
Top Character Bigrams
word rank frequency n-gram
1 18189 וה-
2 12059 המ-
3 11990 ומ-
4 10652 שה-
5 9890 וב-
Top Character Trigrams
word rank frequency n-gram
1 2843 והמ-
2 2247 כשה-
3 1870 ולה-
4 1726 והת-
5 1651 המו-
Top Character 4-Grams
word rank frequency n-gram
1 394 ולהת-
2 360 והמו-
3 298 www.-
4 285 ישרא-
5 271 כשהמ-
Top Character 5-Grams
word rank frequency n-gram
1 266 ישראל-
2 144 אנטי--
3 125 האינט-
4 122 האנטי-
5 109 אינטר-
5855 msec needed at 2018-03-07 17:41